Amendments to the Specification: 

Please replace the Specification of the present application, including the 
Abstract, with the following Substitute Specification. A marked-up version of the 
Substitute Specification and Abstract is attached hereto. 
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SPECIFICATION 
TITLE OF THE INVENTION 
METHOD FOR GENERATING A BIT STREAM FROM AN INDEXING 

TREE 

[0001] The present disclosure relates to a method for generating a bit stream 
from an indexing tree as well as to a coding method and a decoding method which 
make use of the method for generating the bit stream. The invention further relates 
to a coding and decoding device. 

BACKGROUND 

[0002] It is often necessary to read out certain contents from a bit stream 
according to a query previously formulated by a user or to determine with regard to 
certain contents whether the contents are in fact contained in the bit stream at all. In 
this case, a query, defined by a user, can be formulated using a query language such 
as SQL (see reference [1]) or XPATH (see reference [2]). It is advantageous here if 
the entire bit stream does not have to be searched for the desired contents but 
instead if the information is stored in an indexing list or in an indexing tree, so that 
only that part of the bit stream in which the indexing tree or the indexing list is 
stored has to be searched. 

[0003] One problem of reading out data from a bit stream arises in the case 
of a document produced with the aid of the XML language (XML = Extensible 
Markup Language) and represented in the MPEG7 BiM format. With regard to the 
MPEG7 BiM format of an XML document, reference is made to [3] ISO/IEC 
15938-1 Multimedia Content Description Interface - Part 1: Systems, Geneva 
2002. Under this configuration the generated bit stream is subdivided into a 
plurality of units (access units) which consist of a plurality of fragments (fragment 
update units). The units are coded and where necessary sent in the form of an 
MPEG7 BiM stream to one or more recipients. 

[0004] With regard to the querying of information from XML documents, a 
multiplicity of query languages are already known which permit searches to be 
made in the document for specific information. Reference may be made at this 
juncture by way of example to the already mentioned. query language XPATH (see 
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reference [2]). The query language XPATH can be used to define selection criteria 
for filtering desired information within an XML document. In this case the purpose 
of a query can be to assess whether a unit of the bit stream is important for the 
recipient. Also a query can be used in a targeted manner to access specific desired 
information in the XML document. During the generation of the bit stream of an 
XML document the MPEG7 coding method has provided no mechanisms which 
enable random access to specific elements of the XML document. The MPEG7 bit 
stream must therefore be decoded for the purpose of searching for elements. A 
document in XML format is then obtained once again, which document can be 
searched by means of the query language XPATH. The decoding and subsequent 
processing of an XML document in order to search for specific contents is therefore 
very time-consuming, and thus unacceptable for certain time-critical applications. 
Furthermore the problem can arise that the memory in the decoder is limited, with 
the result that the bit stream cannot be fully decoded. In addition, the overhead 
involved in decoding was unnecessary if the XPATH query executed on the 
decoded XML document ends with a negative result. 

[0005] Within the framework of TV- Anytime (TV A), which is described in 
[4] TV- Anytime Specification Series S-3 on Metadata, Part B, Version 13, an index 
structure is used which permits random access to certain elements of a data 
fragment. The index structure consists of a plurality of parts and comprises what is 
referred to as a "key index list" in which all indexed paths of a document are stored. 
When a query is submitted, those paths are compared in turn with the query until a 
matching entry is found in the key index list. Based on the information that is 
stored in the key index list in relation to this entry, the positions in a description 
stream at which the indexed entry is present in coded form can be determined. As a 
result of using the key index list it is no longer necessary to decode irrelevant data 
fragments, so that less memory space is required during a query. However, the 
linear processing of the key index list is time-consuming and the transfer of all the 
indexed paths is laborious and resource-intensive. 

[0006] The document Lam S. W. et al., "Representing lexicons by modified 
trie for fast partial string matching", Character Recognition Technologies, San Jose, 
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1-2 Feb. 1993, Bellingham, SPIE, pages 229-237, describes a fast lexical search 
method wherein an input sequence can have both an indefinite length and also 
several non-specified letters. 

[0007] The document Wong R. K. et al., "An XML repository for molecular 
sequence data", Proceedings IEEE International Symposium on Bio Informatics 
and Biomedical Engineering", pages 35-42, describes a method wherein a large set 
of data can be efficiently searched with the aid of a "skip tree". 

BRIEF SUMMARY 

[0008] Under an exemplary embodiment, a method is provided for 
generating a bit stream from an indexing tree wherein the search for indexed data is 
made easily and efficiently possible. 

[0009] An indexing tree from which a bit stream is generated by means of 
the present method comprises a multiplicity of hierarchy levels, each hierarchy 
level being assigned one or more index nodes wherein the index nodes contain 
index data that is sorted in the indexing tree according to one or more 
predetermined criteria. Under this method, the index data of the index nodes is 
inserted into the bit stream and for each index node there is also inserted into the bit 
stream the information indicating at which position in the bit stream the index data 
of one or more index nodes of the hierarchy level, which lies below the hierarchy 
level of the respective node is located. As a result of storing the additional 
information relating to the index nodes in a lower hierarchy level a search for 
specific index data is considerably simplified since by this means branches to the 
index nodes relevant to the search are made possible. Considerably more efficient 
queries and searches for index data are therefore guaranteed. 

[0010] Under the exemplary embodiment the indexing tree is structured as 
what is referred to as a "B tree" (Balanced Tree) which ensures a balanced 
distribution of the data over the nodes of the tree. A detailed description of the B 
tree can be found in reference [5]. 

[001 1] The index data in the indexing tree can be sorted according to any 
criteria, lexicographically for example. 
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[0012] Index data is inserted into the bit stream according to the depth-first 
ordering principle. Through the use of depth- first ordering the index data in the 
indexing tree is inserted into the bit stream initially according to depth, as a result 
of which the items of information relevant to a query are arranged adjacent to one 
another in the bit stream and information that is not relevant can be efficiently 
skipped. A detailed description of depth-first ordering can be found in reference 
[6]. 

[0013] In an alternate embodiment, the index data comprises paths of a 
document structure tree which has a root node and a plurality of leaf nodes. The 
index data preferably also comprises the value instances of the paths and the 
positions of the value instances in the document which is represented by the 
document structure tree. In addition, the index data also includes the number of 
paths in an index node. 

[0014] In this case the paths can be absolute paths which start from the root 
node of the document structure tree and lead to a leaf node of the document 
structure tree. Under the embodiment, the paths are preferably relative paths, a 
relative path of where a respective index node being a path relative to a path, 
previously inserted into the bit stream, of the respective index node or of an index 
node of a hierarchy level above the hierarchy level of the respective index node. 
The use of relative paths enables commonalities in the paths to be used, for the 
paths of adjacent nodes usually have a common component. In this way, the 
amount of memory space required for storing the index data in the bit stream can be 
reduced. A further reduction of the memory space requirement can be achieved by 
inserting the paths of the index node whose index data is the first of a hierarchy 
level to be inserted into the bit stream in reverse sequence to the sequence in which 
the index data is arranged in the index node. This takes into account the fact that the 
index data at the end of the first index node of a hierarchy level has a greater 
similarity to the index node of the next-higher hierarchy level than the index data at 
the beginning of the first index node. Consequently a particularly effective coding 
results in certain cases through the use of relative paths. 
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[0015] The paths may also comprise description elements of an XML 
document (XML = Extensible Markup Language), where the paths being in 
particular XPATH paths of the XML document. 

[0016] The index data may also be coded in binary format by means of a 
coding method, in particular by means of an MPEG coding method. Under an 
alternate embodiment, the MPEG7 coding method is used. 

[0017] In addition to the above-described method for generating a bit 
stream the disclosure further comprises a method for coding a data structure 
wherein the data elements of the data structure are indexed in an indexing tree, with 
a bit stream being generated according to the inventive method and said bit stream 
being part of the coded data stream. The invention also comprises a method for 
decoding a data structure, the method being embodied in such a way that a data 
structure coded by means of the coding method just described is decoded. 

[0018] The invention further comprises a method for coding and decoding a 
data structure which comprises the above-described coding and decoding method. 

[0019] The invention also comprises a coding device by means of which the 
coding method according to the invention can be performed, as well as a decoding 
device by means of which the decoding method according to the invention can be 
performed. The invention further relates to a corresponding device for coding and 
decoding by means of which the above-described combined coding and decoding 
method can be performed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0020] The various objects, advantages and novel features of the present 
disclosure will be more readily apprehended from the following Detailed 
Description when read in conjunction with the enclosed drawings, in which: 

[0021] Fig. 1 illustrates an indexing tree used in the method under an 
exemplary embodiment; 

[0022] Fig. 2 illustrates a section from the indexing tree of Fig. 1; 

[0023] Fig. 3 illustrates a section from a bit stream which has been 
generated from the indexing tree of Fig. 1 by means of a method under the 
exemplary embodiment; 
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[0024] Fig. 4 illustrates a simplified indexing tree in which the index data is 
ordered lexicographically and represents paths of a document structure tree; 

[0025] Fig. 5 illustrates the indexing tree of Fig. 4, with some of the paths 
of the second hierarchy level being represented as relative paths; 

[0026] Fig. 6 illustrates the indexing tree of Fig. 4, with all the paths of the 
second hierarchy level being represented as relative paths; and 

[0027] Fig. 7 illustrates a variation of the indexing tree according to Fig. 4. 

DETAILED DESCRIPTION 

[0028] Figure 1 shows an example of an indexing tree used in the method 
according to the present disclosure. The tree is what is known as a B tree (where B 
stands for Balanced) which is often used for the indexing of data. The structure of a 
B tree is described for example in the already mentioned reference [5]. The B tree 
has a balanced structure and consequently permits searching with only a 
logarithmic overhead compared to the number of entries, whereas a linear overhead 
is required for searching by means of a list. 

[0029] The indexing tree contains a plurality of nodes 1 to 10, with one or 
more items of index data being stored in each node in the form of what are referred 
to as keys. In the present case the items of index data are XPATH paths of a 
description tree of an XML document. A description of XPATH paths can be found 
in the already mentioned reference [2]. The XPATH paths lead from the root node 
of the description tree of the XML document to the individual leaf nodes of the 
description tree of the XML document. XML elements with simple content and all 
XML attributes are used as leaf nodes in the description tree used in this 
embodiment. Consequently, the index data of the indexing tree shown in Figure 1 
comprises root-leaf paths of an XML description tree. 

[0030] In the indexing tree shown in Fig. 1, the XPATH paths are 
represented as a series of upper- and lowercase letters and are ordered 
lexicographically, with each letter standing for an element name in the XML 
description tree. For example, the path MDMVUFACN in node 1 means: 
MPEG7/Description/MultimediaContent/Vide 

/Accountltem/Costtype/Name. Also stored in the nodes in addition to the path are 
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the value instances in which this path occurs in the XML document. In the present 
case the path MDMVUFACN occurs twice in index node 1 with the instances 
"Total for Production" and "Broadcast". In contrast, the path MDMVUAFAc 
occurs only once in index node 2 with the instance "EUR". 

[003 1] The numbering of the index nodes in the indexing tree of Figure 1 
corresponds to the order in which the index data is inserted into the bit stream with 
the aid of the method according to the invention. In this case the order corresponds 
to what is known as a depth-first ordering scheme, as described for example in the 
already mentioned reference [6]. The insertion of the index data into the bit stream 
according to the depth- first ordering principle results in a lesser complexity during 
access to the index data, since information that is not relevant is skipped during the 
search for index data. 

[0032] Figure 2 shows a section from the indexing tree from Figure 1 with 
the aid of which the querying of index data in this tree will be explained using an 
example. In this case the query is whether the XPATH path 
MPEG7/Description/Multimediacontert 

emination/Disseminator/ Agent/Name contains the value instance "Discovery" (in 
short notation MDMVUADDAN = "Discovery"). Since the search string 
MDMVUADDAN is lexicographically less than the XPATH path 
"MDMVUFACN" of index node 1, the search refers back to index node 2. The 
search string is in turn lexicographically less than the entry "MDMVUAFAc" of 
index node 2. For this reason a transition is made to index node 3 located to the left 
of index node 2. Said index node contains two XPATH paths "MDMVUAAt" and 
"MDMVUADFh". The search string lies between these two XPATH paths, so in 
the next hierarchy level a transition is made to index node 5 which lies between 
index node 4 and index node 6. This index node now contains as its first element 
the searched-for XPATH path "MDMVUADDAN" as well as the searched-for 
value instance "Discovery". Thus, the matching value instance was found in index 
node 5 and the position of the corresponding element in the indexed XML 
document is then obtained. As a result of the fact that the index nodes are inserted 
into the bit stream according to the depth- first ordering principle, all further nodes 
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of the indexing tree are ignored or skipped during the search, so the search for 
elements in the indexing tree has reduced complexity. 

[0033] The insertion of index data into the bit stream is described in greater 
detail below, with reference to Figure 3, which shows a section from a bit stream 
generated using the presently disclosed method. In the bit stream shown in Figure 3 
the numbers of the index nodes are listed for the sake of clarity, although these 
numbers are not necessarily transmitted with the stream. The contents of index 
nodes 1 and 3 are shown in Figure 3. As the first element the nodes initially contain 
the number of entries, that is to say the number of XPATH paths stored in the node. 
This results here in the number 1 for index node 1 and the number 2 for index node 
3. Next, there follows the key for each entry, that is to say the respective XPATH 
path as well as the number of instantiations, i.e. the number of value instances that 
said XPATH path has in the XML document. For index node 1, for example, this 
results in the instantiation number 2 for the XPATH path "MDMVUAFCN", since 
the path having the two value instances "Total for Production" and "Broadcast" 
occurs in the XML document. Furthermore, for each value instance, the position of 
the value instance in the XML document is inserted into the bit stream. Also 
inserted into the bit stream, in addition to this information, is information 
concerning at which position in the bit stream - in particular at which position 
relative to the current position (offset) - the index data of index nodes of the next- 
lower hierarchy level that are connected to the respective index node is located. 
This enables nodes that are not relevant for a predetermined search query to be 
skipped. 

[0034] The nodes branching from one node into a lower hierarchy level are 
referred to below as child nodes. It should be noted with the disclosed method that 
the information relating to the position of the first child node of a hierarchy level 
does not need to be stored since the child node is read in next in the bit stream. 
Thus, as can be seen from Figure 3, only the position of index node 10 (referenced 
as "offset (10)") is stored for index node 1 and only the positions of the index nodes 
5 and 6 (referenced as "offset (5)" and "offset (6)") are stored for index node 3. If it 
transpires, for example, that the searched-for information for a query is 
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lexicographically greater than the XPATH in node 1, a branch is immediately made 
to node 10, so that it is no longer necessary to search index node 2 and the index 
nodes branching off from index node 2. By this means an effective search is made 
possible. 

[0035] The following table lists once again which information is stored per 
node in the bit stream. 
[0036] Table 1: 
Node { 

number of entries 
for each entry: 
Key 

number of instantiations 
for each instantiation: 

value instance 

Position 

for each child node except the first 
offset in the stream 



[0037] The bit stream generated by means of the method is transmitted to 
the recipient and can be read out in accordance with a query which was explained 
by way of example in relation to Figure 2. By means of comparisons between the 
XPATH to be queried and the entries in the index nodes specific node information 
is read out from the stream until the corresponding searched- for XPATH containing 
the searched-for value instance has been found or until no matching XPATH path is 
possible any longer on account of the sorting. In the latter case the desired 
information is not contained in the indexed document. 

[0038] Further embodiments of the method according to the inventions are 
explained below with reference to Figures 4 to 7, which illustrate simplified 
indexing trees with lexicographically ordered entries. Figure 4 shows an indexing 
tree which is to be coded and which comprises nodes Kl to K4. In this case, node 
Kl is referred to as a parent node and the nodes K2 to K4 branching off from node 
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Kl are referred to as child nodes. The entries are paths of a document structure tree. 
Instead of transmitting the paths with their entire length in the bit stream it is 
possible to store what are known as relative paths in the index node to be 
transmitted, said paths being relative in relation to a previously transmitted path of 
the parent node or in relation to a previously transmitted path of the index node 
currently to be transmitted. 

[0039] The insertion of relative paths into the bit stream is illustrated in 
Figure 5. In this case the notation "." signifies a step back in the path to which the 
relative path refers. In node Kl, the relative path "..E" in relation to the preceding 
entry ACB is transmitted for the entry "AE". In the child nodes Kl, K2 and K3, the 
relative paths in relation to the first path of the respective child node are transmitted 
in each case in the second and third entries. 

[0040] Under the embodiment of Figure 5, the full absolute path is 
transmitted for the first paths AA, ACC and AEF of the child nodes K2, K3 and K4 
respectively. It is also possible also to transmit the first paths of the child nodes as 
relative paths in relation to a path of the parent node Kl. This is shown in Figure 6. 
In this case, instead of the paths AA and ACC of the first and second child nodes, 
the relative paths M ..A" and ".C" in relation to the first path of the parent node are 
transmitted. Instead of the path "AEF" of the third child node, the relative path "F" 
in relation to the second path of the parent node is transmitted. In this case the paths 
are preferably stored in the bit stream in their byte representation. 

[0041] As a result of the use of relative paths the data volume in the stream 
is substantially reduced because commonalities in the paths are made use of when 
the paths are stored, with the result that less memory space is required. In 
particular, with relative paths the respective common components of the paths only 
have to be transmitted once. 

[0042] In a further embodiment, a data saving is achieved in that in the 
arrangement of the entries of the index nodes for the first child node K2 a reverse 
sequence is used compared to Figure 4. The result of this is that the entry which has 
the greatest similarity to the preceding parent node Kl is located at the start of the 
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child node K2. Thus, with special embodiments of the indexing tree this can lead to 
substantially shortened relative paths and a reduced memory requirement. 

[0043] The paths can be the already mentioned XPATH paths of an XML 
description tree. Furthermore the paths can be binary coded, by means of an 
MPEG7 coding method for example. In particular, the binary coding methods 
specified in the already mentioned reference [3] can be employed. Compared with 
textual representations, the use of binary coded paths can lead to savings in the 
number of bits to be transmitted. 

[0044] The above described description and drawings are only to be 
considered illustrative of exemplary embodiments, which achieve the features and 
advantages of the invention. Modifications and substitutions to specific process 
conditions and structures can be made without departing from the spirit and scope 
of the invention. Accordingly, the invention is not to be considered as being 
limited by the foregoing description and drawings, but is only limited by the scope 
of the appended claims. 
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